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Abstract. It is well known that the dynamical mechanism ol decoherence 
may cause apparent superselection rules, like that of molecular chirality. These 
'environment-induced' or 'soft' superselection rules may be contrasted with 'hard' 
superselection rules, like that of electric charge, whose existence is usually rigorously 
demonstrated by means of certain symmetry principles. We address the question of 
whether this distinction between 'hard' and 'soft' is well founded and argue that, 
despite first appearance, it might not be. For this we first review in detail some of 
the basic structural properties of the spaces of states and observables in order to 
establish a fairly precise notion of superselection rules. We then discuss two exam- 
ples: 1.) the Bargmann superselection rule for overall mass in ordinary quantum 
mechanics, and 2.) the superselection rule for charge in quantum electrodynamics. 

1 Introduction 

To explain the (apparent) absence of interferences between macroscopically 
interpretable states - like states describing spatially localized objects - is the 
central task for any attempt to resolve the measurement problem. First at- 
tempts in this direction just imposed additional rules, like that of the Copen- 
hagen school, who defined a measurement device as a system whose state- 
space is classical, in the sense that the superposition principle is fully broken: 
superpositions between any two states simply do not exist. In a more modern 
language this may be expressed by saying that any two states of such a sys- 
tem are disjoint, i.e., separated by a superselection rule (see below). Proper 
quantum mechanical systems, which in isolation do obey the superposition 
principle, can then inherit superselection rules when coupled to such classical 
measurement devices. 

Whereas there can be no doubt that the notion of classicality, as we 
understand it here, is mathematically appropriately encoded in the notions of 
disjointness and superselection rules, there still remains the physical question 
how these structures come to be imposed. In particular, if one believes that 
fundamentally all matter is described by some quantum theory, there is no 
room for an independent classical world. Classicality should be a feature that 
is emerging in accordance with, and not in violation of, the basic rules of 
quantum mechanics. This is the initial credo of those who believe in the 
program of decoherence p6|, which aims to explain classicality by means of 
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taking into account dynamical interactions with ambient systems, like the 
ubiquitous natural environment of the situation in question. (Note: It is not 
claimed to resolve the full measurement problem.) This leads to the notion 
of 'environment-induced superselection rules' po[ . 

Fundamental to the concept of dynamical decoherence is the notion of 
'derealization' The intuitive idea behind this is that through some dy- 
namical process certain state characteristics ('phase relations'), which were 
locally accessible at one time, cease to be locally accessible in the course 
of the dynamical evolution. Hence locally certain superpositions cannot be 
verified anymore and an apparent obstruction to the superposition principle 
results. Such mechanisms are considered responsible for the above mentioned 
environment-induced superselection rule, of which a famous physical exam- 
ple is that of molecular chirality (see e.g. |38 and references therein). It has 
been established in many calculations of realistic situations that such dy- 
namical processes of derealization can be extremely effective over short time 
scales. But it is also intuitively clear that, mathematically speaking, it will 
never be strict in any finite time. Hence one will have to deal with notions of 
approximate- respectively asymptotic (for t — > oo) superselection rules and 
disjointness of states [ ^T]p^ |, which needs some mathematical care. 

Since for finite times such dynamical superselection rules are only approx- 
imately valid, they are sometimes called 'soft'. In contrast, 'hard' superse- 
lection rules are those which are rigorously established mathematical results 
within the kinematical framework of the theory, usually based on symmetry 
principles (see section 3 below), or on first principles of local QFT, like in the 
proof for the superselection rule for electric charge J3j| . Such presentations 
seem to suggest that there is no room left for a dynamical origin of 'hard' 
superselection rules, and that hence these two notions of superselection rules 
are really distinct. However, we wish to argue that at least some of the ex- 
isting proofs for 'hard' superselection rules give a false impression, and that 
quite to the contrary they actually need some dynamical input in order to be 
physically convincing. We will look at the case of Bargmann's superselection 
rule for total mass in ordinary quantum mechanics (which is clearly more of 
an academic example) and that of charge in QED. The discussion of the latter 
will be heuristic insofar as we will pretend that QED is nothing but quan- 
tum mechanics (in the Schrodinger representation) of the infinite-dimensional 
(constrained) Hamiltonian system given by classical electrodynamics. For a 
brief but general orientation on the subject of superselection rules and the 
relevant references we refer to Wightman's survey |3{|. 

Let us stress again that crucial to the ideas presented here is of course 
that 'delocalized' does not at all mean 'destroyed', and that hence the loss 
of quantum coherence is only an apparent one. This distinction might be 
considered irrelevant FAPP (for all practical purposes) but it is important 
in attempts to understand apparent losses of quantum coherence within the 
standard dynamical framework of quantum mechanics. 
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As used here, the term 'local' usually refers to locality in the (classical) 
configuration space Q of the system, where we think of quantum states in the 
Schrodinger representation, i.e., as L 2 -functions on Q. Every parametrization 
of Q then defines a partition into 'degrees of freedom'. Locality in Q is a more 
general concept than locality in ordinary physical space, although the latter 
forms a particular and physically important special case. Moreover, on a 
slightly more abstract level, one realizes that the most general description of 
why decoherence appears to occur is that only a restricted set of so-called 
physical observables are at ones disposal, and that with respect to those the 
relevant 'phase relations' apparently fade out of existence. It is sometimes 
convenient to express this by saying that decoherence occurs only with respect 
(or relative) to a 'choice' of observables [^tJ. Clearly this 'choice' is not meant 
to be completely free, since it has to be compatible with the dynamical laws 
and the physically realizable couplings (compare (In this respect the 

situation bears certain similarities to that of 'relevant' and 'irrelevant' degrees 
of freedom in statistical mechanics.) But to fully control those is a formidable 
task - to put it mildly. In any case it will be necessary to assume some a priori 
characterizations of what mathematical objects correspond to observables, 
and to do this in such a general fashion that one can effectively include 
superselection rules. This will be done in the next section. 

2 Elementary Concepts 

In this section we wish to convey a feeling for some of the concepts underlying 
the notion of superselection rules. We will take some care and time to do 
this, since many misconceptions can (and do!) arise from careless uses of 
these concepts. To gain intuition it is sometimes useful to dispense with some 
technicalities associated with infinite dimensions and continuous spectra and 
just look at finite dimensional situations; we will follow this strategy where 
indicated. We use the following, generally valid notations: H denotes a Hilbert 
space, B(H) the algebra of bounded operators on 7i. The antilinear operation 
of taking the hermitean conjugate is denoted by * (rather than f), which 
makes B(H) a ^-algebra. Given a set {A\} where A € A (= some index set), 
then by {^4a}' we denote the 'commutant' of {A\} in B(H), defined by 

{A x }' := {B e B(H) | BA\ = A X B, VA e A}. (1) 

Note that if the set {A\} is left invariant under the *-map (in this case we 
call the set 'self-adjoint'), then is a *-subalgebra of B(TL). Also, the 

definition (|l|) immediately implies that 



A c B => B' C A'. 



(2) 



4 Domenico Giulini 



2.1 Superselection Rules 

There are many different ways to give a structural definition of superselection 
rules. Some stress the notion of state others the notion of observable. Often 
this dichotomy seems to result in very different attitudes towards the fun- 
damental significance of superselection rules. This really seems artificial in a 
quantum mechanical context. In quantum field theory, i.e., if the underlying 
classical system has infinitely many degrees of freedom, the situation seems 
more asymmetric. This is partly due to the mathematical difficulties to de- 
fine the full analog of the Schrodinger representation, i.e., to just construct 
the Hilbert space of states as L 2 space over the classical configuration space. 
In this paper we will partly ignore this mathematical difficulty and proceed 
heuristically by assuming that such a Schrodinger representation (of QED) 
exists to some level of rigour. 

In traditional quantum mechanics, which stresses the notion of state, a 
system is fundamentally characterized by a Hilbert space, 7i, the vectors 
of which represent (pure) states. We say 'represent' because this labeling 
by states through vectors is redundant: non-zero vectors which differ by an 
overall complex number label the same state, so that states can also be labeled 
by rays. We will use VH to denote the space of rays in H. In many cases of 
interest this Hilbert space is of course just identified with the space of L 2 - 
functions over the classical configuration space. Now, following the original 
definition given by W 3 J35|] , we say that a superselection rule operates on 
7i, if not all rays represent pure states, but only those which lie entirely in 
certain mutually orthogonal subspaces Hi C H, where 



The only rays which correspond to pure states are those in the disjoint union 



Since no vector which lies skew to the partition (g) can, by assumption, rep- 
resent a pure state, the superposition principle must be restricted to the Tti- 
Moreover, since observables map pure states to pure states, they must leave 
the TLi invariant and hence all matrix-elements of observables between vectors 
from different sectors vanish. The Tii are called coherent sectors if the observ- 
ables act irreducibly on them, i.e., if no further decomposition is possible; this 
is usually implied if a decomposition ([}]) is written down. States which lie in 
different coherent sectors are called disjoint. Note that disjointness of states 
is essentially also a statement about observables, since it means orthogonal- 
ity of the original states and the respective states created from those with 
all observables. The existence of disjoint states is the characteristic feature 
of superselection rules. 



(3) 




(4) 
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From this we see that a partition (g) into coherent sectors implies that the 
set of physical observables is strictly smaller than the set of all self-adjoint 
(w.l.o.g. bounded) operators on H. It can be characterized by saying that 
observables are those self adjoint operators on H which commute with the 
orthogonal projectors Pi : H — ► Hi. So the Pi are themselves observables and 
generate the center (see below) of the algebra of observables. 

This suggests a 'dual', more algebraic way to look at superselection rules, 
which starts with the algebra of observables O. Then superselection rules are 
said to occur if the algebra of observables, O, - which we think of as being 
given by bounded operators on some Hilbert space ftfl - has a non-trivial 
center O c . Recall that 

O c :={AeO\AB = BA, VB e O}. (5) 

Suppose O c is generated by self-adjoint elements {C^, p = 1,2,..} which have 
simultaneous eigenspaces Hi, then the Hi's are just the coherent sectors. 
Indeed, as already remarked, matrix elements of operators from O between 
states from different coherent sectors (i.e. differing in the eigenvalue of at 
least one C^) necessarily vanish. Thus if <f>\ and 4>2 are two non-zero vectors 
from Hi and Hj with i ^ j, their superposition <j) '■— <f>i + 4>2 defines a state 
whose density matrix p := P^ (=orthogonal projector onto the ray generated 
by 4>) satisfies 

tv{ P A) = tr((A lPl + \ 2 P2)A), VA e O, (6) 

where Ai^ = H^i^HVll^ll 2 and p\^ — Pfa 2 - This means that p is a non-pure 
state of 0, since it can be written as a non-trivial convex combination of other 
density matrices. Hence we come back to the statements expressed by ([|) and 
. Also note the following: in quantum mechanics the decomposition of a 
non-pure density matrix as a convex combination of pure density matrices - 
the so-called extremal decomposition - is generically not unique, thus pre- 

1 In Algebraic Quantum Mechanics one associates to each quantum system an 
abstract C*-algebra, C, which is thought of as being the mathematical object that 
fully characterizes the system in isolation, i.e. its intrinsic or 'ontic' properties. 
But this is not yet what we call the algebra of observables. This latter algebra 
is not uniquely determined by the former. It is obtained by studying faithful 
representations of C in some Hilbert space TC, such that C can be identified with 
some subalgebra of B(TC) (the bounded operators on Ti.). This is usually done 
by choosing a reference state (positive linear functional) on C and performing 
the GNS construction. Then C inherits a norm which is used to close C (as 
topological space) in B(TL). It is this resulting algebra which corresponds to our 
O. Technically speaking it is a von Neumann algebra which properly contains an 
embedded copy of C. The added observables (those in O — C) do not describe 
intrinsic but contextual properties. For example, it may happen that O has non- 
trivial center whereas C doesn't. In this case the superselection rules described 
by O are contextual. See |3ll] for a more extended discussion of this point. 
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venting the (ignorance-) interpretation as statistical "mixtures" .[] However, 
for the special density matrices of the form p = \cj))((f>\, where \<j>) £ Ti., the 
extremal decomposition is unique and given by <fi — A;P0 i , where <pi is 
the orthogonal projection of <p into Tii, P<p i the orthogonal projector onto 
cf>i's ray, and A^ = ||<Ai|| 2 /||<^|| 2 - This is the relevance of superselection rules 
for the measurement problem: to produce unique extremal decompositions 
- and hence statistical 'mixtures' in the proper sense of the word - into an 
ensemble of pure states. There is a long list of papers dealing with the mathe- 
matical problem of how superselection sectors can arise dynamically; see e.g. 
[ pj|j30| ,3,p6| and the more general discussions in [ ^7plj] . 

2.2 Dirac's Requirement 

Dirac was the first who spelled out certain rules concerning the spaces of 
states and observables Q. He defined the notion of compatible (i.e., simul- 
taneously performable) observations, which mathematically are represented 
by a set of commuting observables, and the notion of a complete set of such 
observables, which is meant to say that there is precisely one state for each 
set of simultaneous "eigenvalues". Starting from the hypothesis that states 
are faithfully represented by rays, Dirac deduced that a complete set of such 
mutually compatible observables existed. But this only makes sense if all the 
observables in question have purely discrete spectra. 

In the general case one has to proceed differently: We heuristically define 
Dirac's requirement as the statement, that there exists at least one complete 
set of mutually compatible observables and show how it can be rephrased 
mathematically so that it applies to all cases. In doing this we essentially 
follow Jauch's exposition (22). To develop a feeling for what is involved, we 
will first describe some of the consequences of Dirac's requirement in the most 
simplest case: a finite dimensional Hilbert space. We will use this insight to 
rephrase it in such a way to stay generally valid in infinite dimensions. 

Gaining intuition in finite dimensions. So let Tt be an rt-dimensional 
complex Hilbert-space, then B(Ti) is the algebra of complex n x n matrices. 
Physical observables are represented by hermitean matrices in B(Ti), but 
we will explicitly not assume the converse, namely that all hermitean matri- 
ces correspond to physical observables. Rather we assume that the physical 
observables are somehow given to us by some set S of hermitean matrices. 
This set does not form an algebra, since taking products and complex linear 
combinations does not preserve hermiticity. But for mathematical reasons it 
would be convenient to have such an algebraic structure, and just work with 
the algebra O generated by this set, called the algebra of observables. [Note 

2 Hence the term 'mixture' for a non-pure state is misleading since we cannot tell 
the components and hence have no ensemble interpretation. For this reason we 
will say 'non-pure state' rather than 'mixture'. 
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the usual abuse of language, since only the hermitean elements in O are ob- 
servables.] But for this replacement of S by O to be allowed S must have been 
a set of hermitean matrices which is uniquely determined by O, for otherwise 
we can not reconstruct the set S from O. To grant us this mathematical con- 
venience we assume that S was already maximal, i.e. that S already contains 
all the hermitean matrices that it generates. But we stress that there seems 
to be no obvious reason why in a particular practical situation the set of 
physically realizable observables should be maximal in this sense. 

We may choose a set {Oi, . . . O m } of hermitean generators of O. Then O 
may be thought of as the set of all complex polynomials in these (generally 
non-commuting) matrices. But note that we need not consider higher powers 
than (n — 1) of each Oi, since each complex n x n matrix O is a zero of its 
own characteristic polynomial po, i.e. satisfies po(0) = 0, by the theorem of 
Cayley-Hamilton. Since this polynomial is of order n, O n can be re-expressed 
by a polynomial in O of order at most (n— 1). For example, the *-algebra 
generated by a single hermitean matrix O can be identified with the set of 
all polynomials of degree at most (n — 1) and whose multiplication law is as 
usual, followed by the procedure of reducing all powers n and higher of O via 
Po(O) = 0. 

Now let {Ai, ■ ■ ■ , A m } =; {Ai} be a complete set of mutually commuting 
observables. It is not difficult to show that there exists an observable A and 
polynomials pi, i = \,---,m such that Ai — Pi(A) (see Q for a simple 
proof). This actually means that the algebra generated by {Ai} is just the n- 
dimensional algebra of polynomials of degree at most n—1 in A (see below for 
justification), which we call A. This algebra is abelian, which is equivalently 
expressed by saying that A is contained in its commutant (compare ([l])): 



A C A' 'A is abelian' (7) 



Now comes the requirement of completeness. In terms of A it is easy to 
see that it is equivalent to the condition that A has a simple spectrum (i.e. 
the eigenvalues are pairwisc distinct). This has the following consequence: 
Let B be an observable that commutes with A, then B is also a function 
of A, i.e., pb(B) = A for some polynomial ps- The proof is simple: We 
simultaneously diagonalizc A and B with eigenvalues a a and (3 a , a = 1, • • • , n. 
We wish to find a polynomial of degree n—1 such that ps(a a ) = (3 a . Writing 
Pb(x) — a n -ii" _1 + ■ • ■ + ao, this leads to a system of n linear equations 
(a b a := 6 th power of a a ) 

n-l 

a b a a,b = Pa, for a = 1, • • • , n (8) 

b=0 

for the n unknowns (aa, ■ ■ ■ , a n -i)- Its determinant is of course just the Van- 
dermonde determinant for the n tuple {ax, ■ ■ ■ ,a n ): 

det{o&} = l[(a a -a b ), (9) 

a<b 
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which is non-zero if and only if (=iff) A's spectrum is simple. This implies 
that every observable that commutes with A is already contained in A. (It 
follows from this that the algebra generated by {Ai] is equal to, and not 
just a subalgebra of, the algebra generated by {^4}, as stated above.) Since a 
*-algebra is generated by its self-adjoint elements (observables) , A cannot be 
properly enlarged as abelian *- algebra by adding more commuting generators. 
In other words, A is maximal. Since A! is a *-algebra, this can be equivalently 
expressed by 



A' C A 'A is maximal' (10) 



Equations (0) and ([To]) together are equivalent to Dirac's condition, which 
can now be stated in the following form, first given by Jauch |^2j : the algebra 
of observables O contains a maximal abelian ^-subalgebra A C O, i.e., 



Dirac's requirement, 1 st version: 3 A C O satisfying A = A' (11) 



This may seem as if Dirac's requirement could be expressed in purely 
algebraic terms. But this is deceptive, since the very notion of 'commutant' 
(compare (Q)) makes reference to the Hilbert space Tt through B(7i). Without 
further qualification the term 'maximal' always means maximal in B(Ti.)^ 

This reference to TL can be further clarified by yet another equivalent 
statement of Dirac's requirement. Since A consists of polynomials in the 
observable A, which has a simple spectrum, the following is true: there exists 
a vector \g) £ TL, such that for any vector <fi € TL there exists a polynomial 
p,p such that 

p*(A)\g) = (12) 

Such a vector \g) is called a generating or cyclic vector for A in TL. The 
proof is again very simple: let {<j)\, • • • , <fin} be the pairwise distinct, non-zero 
eigenvectors of A (with any normalization); then choose 

i$> = X». ( 13 ) 

2 = 1 

Equation ( p"2| ) now defines again a system of n linear equations for the n 
coefficients a„_i, • • • , ao of the polynomial p^, whose determinant is again 
the Vandermonde determinant (0) for the n eigenvalues ot\, - ■ ■ ,a n of A. 
Conversely, if A had an eigenvalue, say a±, with eigenspace TLi of two or 
higher dimensions, then such a cyclic \g) cannot exist. To see this, suppose it 
did, and let \4>i) € Hi be orthogonal to the projection of \g) into Hi- Then 
(4>i\p(A)g) = for all polynomials p. Thus \4>i) is unreachable, contradicting 
our initial assumption. Hence a simple spectrum of A is equivalent to the 
existence of a cyclic vector. 

3 The condition for an abelian A C O to be maximal in O would be A = A' PI 
O. Such abelian subalgebras always exist (use Zorn's Lemma to show this), in 
contrast to those A C O which satisfy the stronger condition to be maximal in 
the ambient algebra B(Ti), which need not exist for a given O C B(Ti.). 
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The general case. In infinite dimensions we have to care a little more 
about the topology on the space of observables, since here there are many 
inequivalent ways to generalize the finite dimensional case. The natural choice 
is the so-called 'weak topology', which is characterized by declaring that a 
sequence {A{\ of observables converges to the observable A if the sequence 
(4>\Ai\ip) of complex numbers converges to (0|.A|?/>) for all |</>), \ip) € Ti. Hence 
one also requires that the algebra of observables is weakly closed (i.e., closed 
in the weak topology). Such a weakly closed *-subalgebra of B(TL) is called 
a W*- or von-Neumann-algebra (we shall use the first name for brevity). 

A crucial and extremely convenient point is, that the weak topology is 
fully encoded in the operation of taking the commutant (see @)), in the 
following sense: Let jA\} be any subset of B(H), then {^4a}' is automatically 
weakly closed (see ]22| p 716 for a simple proof) and hence a lT*-algebra. 
Moreover, the weak closure of a ^-algebra A C B(TL) is just given by A" 
(the commutant of the commutant). Hence we can characterize a W'-algebra 
purely in terms of commutants: A is W* iff A = A" . 

This allows to easily generalize the notion of 'algebra generated by observ- 
ables': Let {0\} be a set of self-adjoint elements in B(Tt), then O :— {0\}" 
is called the (W*-) algebra generated by this set. This definition is natural 
since {0\}" is easily seen to be the smallest H / *-algebra containing {0\}, for 
if {0\} C B C O for some W^*-algebra B, then taking the commutant twice 
yields B = O.FI 



Now we see that Dirac's requirement in the form (11) directly translates to 
the general case if all algebras involved (i.e. A and O) are understood as W*- 
algebras. Now we also know what a 'complete set of (bounded) commuting 
observables' is, namely a set {A\} C B(H) whose generated IT*-algebra 
A := {^4a}" is maximal abelian: A — A'. This latter condition is again 
equivalent to the existence of a cyclic vector \g) £ TL for A, where in infinite 
dimensions the definition of cyclic is that {„4|g)} is dense in (rather than 
equal to) Ti. It is also still true that there is an observable A such that all 
A\ are functions (in an appropriate sense, not just polynomials of course) 
of A Q. But since A's spectrum may be (partially) continuous, there is no 
direct interpretation of a 'simple' spectrum as in finite dimensions. Rather, 
one now defines simplicity of the spectrum of A by the existence of a cyclic 
vector for A = {A}". 

Now we come to our final reformulation of Dirac's condition. Namely, 
looking at (|lTJ) , we may ask whether we could not reformulate the existence 
of such a maximal abelian A purely in terms of the algebra of observables O 
alone. This is indeed possible. We have ACO^O'Ci'=iC0, hence 
CCO. Since O = O" the last condition is equivalent to saying that O' is 
abelian (C C O"), or to saying that O' is the center O c of O, since by (Q) and 



4 Note: for any M C B{TL) definition (jl|) immediately yields M C M" and hence 
M' D M'" (by (§)). But also M' C M'" (by replacing M M'); therefore 
M' = M'" for any M C B(H). 
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(||) the center can be written as O c — O n O' . Now, conversely, it was shown 
in that an abelian O' implies the existence of a maximal abelian A C O. 
Hence we have the following alternative formulation of Dirac's requirement, 
first spelled out, independently of (^lj), by Wightman |37j], who called it the 
'hypothesis of commutative superselection rules': 



Dirac's requirement, 2 version: O' is abelian (14) 



There are several interesting ways to interpret this condition. From its 
derivation we know that it is equivalent to the existence of a maximal abelian 
AC O. But we can in fact make an apparently stronger statement, which also 
relates to the earlier footnote |j| namely: (|l4|) is equivalent to the condition, 
that any abelian A C O that is maximal in O, i.e. satisfies A = A' D O, is 
also maximal in _B(7i).^] 



2.3 Dirac's condition and gauge symmetries 



Another way to understand (|14|) is via its limitations on gauge-symmetries. 
To see this, we mention that any Vy*-algebra is generated by its unitary 
elements. Hence O' is generated by a set {U\} of unitary operators. Each 
U\ commutes with all observables and therefore generates a one-parameter 
group of gauge-transformations. Condition (|l4|) is then equivalent to saying 
that the total gauge group, which is generated by all U\, is abelian. Note also 
that an abelian O' implies that the gauge-algebra, {U\}" — O', is contained 
in the observables, O' C O" = O, so that O' — O c . From this one can infer 
the following central statement: 



Dirac's requirement implies that gauge- and 
sectorial structures are fully determined by the 
center O c of the algebra of observables O. 



(15) 



To see in what sense this is true we remark that for Vy*-algebras we can 
simultaneously diagonalize all observables in O c . That means that we can 
write TL in an essentially unique way as direct integral over the real line of 
Hilbert spaces 7i(A) using some (Lebesgue-Stieltjes-) measure a: 



H= / da(\)H(\). 

Jr 



(16) 



Operators in O respect this decomposition in the sense that each O £ O acts 
on 7i componentwise via some bounded operator 0(\) on Tt(\). If O S O c 

5 Proof: We need to show that (C abelian) <S> (A = A' n O A = A'). '=>': O' 
abelian implies O' C O" = O and A C O implies O' C A', so that O'C/nO. 
Hence A — A' tl O implies C C A, which implies A' C O" = O, and hence 
A = A'. (A = A' n O => A — A') is equivalent to A' C O, which implies 
O' C ^4" = and hence that O is abelian. 
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then each 0(A) is a multiple 0(A) S C of the unit operator. Moreover, the 
set of all {0(A)} induced from O for each fixed A acts irreducibly on 7i(A).[] 
Hence, provided that Dirac's requirement is satisfied, (^) is the generally 
valid version of (^J). The notion of disjointness now acquires an intuitive 
meaning: two states l^q) and |<Pa) are separated by a superselection rule (are 
disjoint), iff their component-state-functions A — > |-0i(A)) and A — * 1^2 (A)) 
have disjoint support on R (up to measure-zero sets). Note that by spec- 
tral decomposition the superselection observables can be decomposed into 
the projectors in O c , which for (|l6| ) are all given by multiplications with 
characteristic functions %(A) for er-measurable sets in R. 



Non-abelian gauge groups We have seen that the fulfillment of Dirac's 
requirement allows to give a full structural characterisation for the spaces of 
(pure) states and observables. How general is this result? Does it exclude cases 
of physical interest? At first glance this seems indeed to be the case: just con- 
sider a situations with non-abelian gauge groups; for example, the quantum 
mechanical system of n > 2 identical spinless particles with n-particle Hilbert 
space H = L 2 (R 3 ™) on which the permutation group G = S n of n objects acts 
in the obvious way by unitary operators U(g). That these particles are iden- 
tical means that observables must commute with each U(g). Without further 
restrictions on observables one would thus define O := {U(g), g E G}'. Hence 
O' is the H / *-algebra generated by all U(g), which is clearly non-abelian, thus 
violating (jlj). But does this generally imply that general particle statistics 
cannot be described in a quantum-mechanical setting which fulfills Dirac's 
requirement? The answer to this question is 'no'. Let us explain why. 

If we decompose H according to the unitary, irreducible representations 
of G wc obtain (@0|) 

p(n) 

H = 0H„ (17) 

i=l 

where i labels the p(n) inequivalent, unitary, irreducible representations Z), 
of G of dimension di. Each Hi has the structure Hi = C di ®Hi, where G acts 
irreducibly via Di on C di and trivially on Hi whereas O acts irreducibly via 
some ^-representation Wi on Hi and trivially on C d * . Hi and TTj are inequivalent 
if i ^ j. Hence we see that Hi furnishes an irreducible representation for O, iff 
di = 1, i.e., for the Bose and Fermi sectors only. Pure states from these sectors 
are just the rays in the corresponding Hi. In contrast, for di > 1, given a 
non-zero vector \<f>) € Hi, all non-zero vectors in the (^-dimensional subspace 
C di ® \4>) CHi define the same pure state, i.e., the same expectation- value- 
functional on O. Furthermore, a vector in Hi = C dt §Z>Hi which is not a pure 

6 It is this irreducibility statement which depends crucially on the fulfillment of 
Dirac's requirement. In general, the 0(A)'s will act irreducibly on 7i(A) for each 
A, iff O c is maximal abelian in O' , i.e., iff O c = (O c )' n O' . But we already saw 
that (O) also implies O c = C so that this is fulfilled. 



12 Domenico Giulini 



tensor product defines a non-pure state, since the restriction of O S O to 
Hi is of the form 1 eg) O, which means that a vector in Hi defines a state 
given by the reduced density matrix obtained by tracing over the left (i.e. 
C di ) state space. From elementary quantum mechanics we know that the 
resulting state is pure, iff the vector in Hi was a pure tensor product (i.e. 
of rank one). Hence in those Hi where di > 1 not all vectors correspond to 
pure states, and those which do represent pure states in a redundant fashion 
by higher dimensional subspaces, sometimes called 'generalized rays' in the 
older literature on parastatistics |2§| . 

However, the factors C d ' are completely redundant as far as physical 
information is concerned, which is already fully encoded in the irreducible 
representations iti of O on Hi] no further physical information is contained 
in dj-fold repetitions of 7Tj. Hence we can define a new, truncated Hilbert 
space 

p(n) 

H:=0Wi. (18) 

i=l 



This procedure has also been called 'elimination of the generalized ray' in 
the older literature on parastatistics jL8| - see also H] for a more recent 
discussion of this point. Since every pure state in H is also contained in H, just 
without repetition, these two sets are called 'phenomenological equivalent' in 
the literature on QFT (e.g. in chapter 6.1.C of j|]). The point is that pure 
states are now faithfully labelled by rays in the Hi and that O' - where the 
commutant is now taken in B(H) rather than B(H) - is generated by 1 and 
the pin) (commuting!) projectors into the His. Hence Dirac's requirement is 
satisfied. But clearly the original gauge group has no action on H anymore, 
but there is also no physical reason why one should keep itj^| It served to 
define 0, but then only its irreducible representations TTi are of interest. 
Only a residual action of the center of G still exists, but the gauge group 
generated by the projectors into the Hi consists in fact of the continuous 
group of p(n) copies of U(l), one global phase change for each sector. Its 
meaning is simply to induce the separation into the different sectors (Hi, 71$), 
and that in accordance with Dirac's requirement. 

To sum up, we have seen that even if a theory is initially formulated via 
non-abelian gauge groups, we can give it a physically equivalent formula- 
tion that has at most a residual abelian gauge group left and hence obeys 
Dirac's requirement. Hence the 'obvious' counterexamples to Dirac's require- 
ment turn out to be harmless. This is generally true in quantum mechanics, 



7 In jl(| Dirac's requirement together with the requirement that the physical 
Hilbert space must carry an action of the gauge group has been used to "prove" 
the absence of parastatistics. In our opinion there seems to be no physical reason 
to accept the second requirement and hence the "proof" ; compare fiM and fl4| . 



Decoherence and Superselection 13 



but in quantum field theory there are genuine possibilities to violate Dirac's 
condition which we will ignore herej^J 

3 Superselection Rules via Symmetry Requirements 

The requirement that a certain group must act on the set of all physical 
states is often the (kinematical) source of superselection rules. Here I wish to 
explain the structure of this argument. 

Note first that in quantum mechanics we identify the states of a closed 
system with rays and not with vectors which represent them (in a redundant 
fashion). It is therefore not necessary to require that a symmetry group G 
acts on the Hilbert space H, but rather it is sufficient that it acts on VH, 
the space of rays, via so-called ray-representations. Mathematically this is a 
non-trivial relaxation since not every ray-representation of a symmetry group 
G (i.e. preserving the ray products) lifts to a unitary action of G on TL. What 
may go wrong is not that for a given g S G we cannot find a unitary (or 
anti-unitary) operator U g on 7i; that is assured by Wigner's theorem (see 
[|| for a proof). Rather, what may fail to be possible is that we can choose 
the U g 's in such a way that we have an action, i.e., that U Sl U S2 = U gig2 . 
As is well known, this is precisely what happens for the implementation of 
the Galilei group in ordinary quantum mechanics. Without the admission 
of ray representations we would not be able to say that ordinary quantum 
mechanics is Galilei invariant. 

To be more precise, to have a ray-representation means that for each 
g G G there is a unitary^] transformation U g which, instead of the usual 
representation property, are only required to satisfy the weaker condition 

U 91 U S2 =exp(^(gi,ff 2 ))U ffig2 , (19) 

for some function ^:GxG^R, called multiplier exponent, satisfying^ 

£(i s ff) = £GM) = o, (20) 
£(91,92) - £(gi,9293) + £(gm,93)-£(92,gs) = o. (21) 

The second of these conditions is a direct consequence of associativity: 
U sl (U ff2 U 93 ) = (U Sl U 92 )U g3 . Obviously these maps project to an action of 

8 An abelian O' implies that O is a von Neumann algebra of type I (see , chap- 
ter 8) whereas truly infinite systems in QFT are often described by type III 
algebras. 

9 For simplicity we ignore anti-unitary transformations. They cannot arise if, for 
example, G is connected. 

10 The following conditions might seem a little too strong, since it would be sufficient 
to require the equalities in (20) and ( |2~lj ) only mod 2ir; this also applies to (p2]). 
But for our application in section 4 it is more convenient to work with strict 
equalities, which in fact implies no loss of generality; compare 132]]. 
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G on VH. Any other lift of this action on VH onto H is given by a redefini- 
tion U g — > U 9 := exp(i7(g))U 9! for some function 7 : G — > R with 7(1) = 0, 
resulting in new multiplier exponents 

¥{91,92) =€(gi,, 92) + 7(51) -7(3132) + 7(52), (22) 

which again satisfy (20) and (|l|). The ray representations U and U' are then 
said to be equivalent, since the projected actions on VH are the same. We 
shall also say that two multiplier exponents £, £' are equivalent if they satisfy 
( p2| ) for some 7. 

We shall now see how the existence of inequivalent multiplier exponents, 
together with the requirement that the group should act on the space of 
physical states, may clash with the superposition principle and thus give rise 
to superselection rules. For this we start from two Hilbert spaces H 1 and H" 
and actions of a symmetry group G on VH and VH" , i.e., ray representations 
U' and U" on H' and H" up to equivalences @. We consider H = H' ® H" 
and ask under what conditions does there exist an action of G on TH which 
restricts to the given actions on the subsets VH' and VH" ■ Equivalently: 
when is U = U' U" a ray representation of G on H for some choice of 
ray-representations U' and U" within their equivalence class? To answer this 
question, we consider 

u 91 u 92 = (u 3i eu;;)(u 32 eu; 2 ) 

= cxp^'( 5l ,. 92 ))U; i92 ©exp(r(3i,3 2 ))U;' l92 (23) 

and note that this can be written in the form (filf), for some choice of 
within their equivalence class, iff the phase factors can be made to coin- 
cide, that is, iff £' and £" are equivalent. This shows that there exists a 
ray-representation on H which restricts to the given equivalence classes of 
given ray representations on H' and H' , iff the multiplier exponents of the 
latter are equivalent. Hence, if the multiplier exponents £' and £" are not 
equivalent, the action of G cannot be extended beyond the disjoint union 
VH' U VH" . Conversely, if we require that the space of physical states must 
support an action of G, then non-trivial superpositions of states in H' and 
Ti." must be excluded from the space of (pure) physical states. 

This argument shows that if we insist of implementing G as symmetry 
group, superselection rules are sometimes unavoidable. A formal trick to avoid 
them would be not to require G, but a slightly larger group, G, to act on the 
space of physical states. G is chosen to be the group whose elements we label 
by (0,g), where 6 EM., and the multiplication law is 

3i32 = 31)^2,32) = {Ox +9 2 +t{gi, 92), 9x92)- (24) 

It is easy to check that the elements of the form (9, 1) lie in the center of G 
and form a normal subgroup = R which we call Z. Hence G/Z — G but G 
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need not be a subgroup of G. G is a central K extension^ of G (see e.g. 
[ [j2| ) . Now a ray-representation U of G on TL defines a proper representation 
U of G on TC by setting 

C/ (e , 9) := ex P (*'0)U ff . (25) 

Then G is properly represented on H' and H" and hence also on H = H'®TL". 
The above phenomenon is mirrored here by the fact that ^ acts trivially on 
VH' and but non-trivially on VH, and the superselection structure 

comes about by requiring physical states to be fixed points of Z's action. 



4 Bargmann's Superselection Rule 

An often mentioned textbook example where a particular implementation 
of a symmetry group allegedly clashes with the superposition principle, such 
that a superselection rule results, is Galilei invariant quantum mechanics (e.g. 
[||; see also Wightman's review j39|). We will discuss this example in detail 
for the general multi-particle case. (Textbook discussions usually restrict to 
one particle, which, due to Galilei invariance, must necessarily be free.) It will 
serve as a test case to illustrate the argument of the previous chapter and also 
to formulate our critique. Its physical significance is limited by the fact that 
the particular feature of the Galilei group that is responsible for the existence 
of the mass superselection rule ceases to exist if we replace the Galilei group 
by the Poincare group (i.e. it is unstable under 'deformations'). But this is not 
important for our argument. Let now G be the Galilei group, an element 
of which is parameterized by (R, v, a, 6), with R a rotation matrix in 50(3), 
v the boost velocity, a the spatial translation, and b the time translation. Its 
laws of multiplication and inversion are respectively given by 

9ig 2 = (Ri,vx,ax,b 1 )(R 2 ,v 2 ,a 2 ,b 2 ) 

= {R\R 2 , vi + R% ■ v 2 , a x + Ri ■ a 2 + v ib 2 , h + b 2 ), (26) 
rf 1 = (R, v, a, by 1 = [Rr l , -R- 1 ■ v , -iT 1 ■ (a - vb) , ~b). (27) 

We consider the Schrodinger equation for a system of n particles of positions 
Xi, masses TOj, mutual distances ry := — Xj\\ which interact via a Galilei- 
invariant potential V({rij}), so that the Hamilton operator becomes H = 
-h 2 Y,t + Th e Hilbert space is H = L 2 (R 3n ,d 3 x 1 ■ ■ ■ d 3 x n ). 

G acts on the space {configurations} x {times} = R 3n+1 as follows: Let g = 
(R, v, a, b), then g({xi}, t) := ({R-Xi+vt+a} , t+b). Hence G has the obvious 

11 Had we defined the multiplier exponents mod 2tt (compare footnote p"o| ) then we 
would have obtained a (7(1) extension, which would suffice so far. But in the 
next section we will definitively need the R extension as symmetry group of the 
extended classical model discussed there. 

12 In General Relativity, where the total mass can be expressed as a surface integral 
at 'infinity', the issue of mass superselection comes up again; see e.g. pM and M. 
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left action on complex- valued functions on M 3n+1 : (g, ip) — > ipo g^ 1 . However, 
these transformations do not map solutions of the Schrodinger equations into 
solutions. But, as is well known, this can be achieved by introducing an 
K. 3n+1 -dependent phase factor (see e.g. [O] for a general derivation). We set 
M = rrii for the total mass and r c = jj X)j m i x i f° r the center-of-mass. 
Then the modified transformation, T s , which maps solutions (i.e. curves in 
H) to solutions, is given by 

T g iP({ Xi }, t) := exp (£M[w • (r c - a) - \v\t - b)}) VOT^i}. t)). (28) 

However, due to the modification, these transformations have lost the prop- 
erty to define an action of G, that is, we do not have T Sl o T g2 — T Sl£(2 . 
Rather, a straightforward calculation using (^6|) and (^) leads to 

T si oT S2 = exp(^(3i,3 2 ))T gig2 , (29) 

with non-trivial multiplier exponent 

£(9i,92) = f(v 1 -R 1 -a 2 + lvlb 2 ). (30) 

Although each T 9 is a mapping of curves in TL 1 it also defines a unitary 
transformation on Ji itself. This is so because the equations of motion define 
a bijection between solution curves and initial conditions at, say, t = 0, which 
allows to translate the map T g into a unitary map on 7i, which we call U 9 . 
It is given by 

U 9 V({*i}) = exp (iM[v ■ (r c - a) + \v 2 b]) exp(-| ^({iT 1 (xi-a+vb)}), 

(31) 

and furnishes a ray-representation whose multiplier exponents are given by 
(|30|). It is easy to see that the multiplier exponents are non-trivial, i.e., not 
removable by a redefinition (f22|). The quickest way to see this is as follows: 
suppose to the contrary that they were trivial and that hence ( p2] ) holds 
with £' = 0. Trivially, this equation will continue to hold after restriction to 
any subgroup Go C G. We choose for Go the abelian subgroup generated by 
boosts and space translations, so that the combination 7(51)— 7(51 g 2 ) + 7(52) 
becomes symmetric in gi,g 2 € G . But the exponent (|o|) stays obviously 
asymmetric after restriction to Go- Hence no cancellation can take place, 
which contradicts our initial assumption. 

The same trick immediately shows that the multiplier exponents are in- 
equivalent for different total masses M. Hence, by the general argument given 
in the previous chapter, if Ti' and TL" correspond to Hilbert spaces of states 
with different overall masses M' and M", then the requirement that the 
Galilei group should act on the set of physical states excludes superpositions 
of states of different overall mass. This is Bargmann's superselection rule. 

I criticize these arguments for the following reason: The dynamical frame- 
work that we consider here treats 'mass' as parameter(s) which serves to 
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specify the system. States for different overall masses are states of differ- 
ent dynamical systems, to which the superposition principle does not even 
potentially apply. In order to investigate a possible violation of the super- 
position principle, we must find a dynamical framework in which states of 
different overall mass are states of the same system; in other words, where 
mass is a dynamical variable. But if we enlarge our system to one where mass 
is dynamical, it is not at all obvious that the Galilei group will survive as 
symmetry group. We will now see that in fact it does not, at least for the 
simple dynamical extension which we now discuss. 

The most simple extension of the classical model is to maintain the Hamil- 
tonian, but now regarded as function on an extended, 6n + 2n - dimensional 
phase space with extra 'momenta' rrii and conjugate generalized 'positions' 
Aj. Since the Ai's do not appear in the Hamiltonian, the m^s are constants 
of motion. Hence the equations of motion for the x^s and their conjugate 
momenta Pi are unchanged (upon inserting the integration constants m,) and 
those of the new positions Aj are 

dV n 2 

M*) = 7T- - A, (32) 
om,i 2m* 

which, upon inserting the solutions {xi(t),Pi(t)}, are solved by quadrature. 

Now, the point is that the new Hamiltonian equations of motion do not 
allow the Galilei group as symmetries anymore. But they do allow the R- 
extension G as symmetries p3| . Its multiplication law is given by (p3) , with £ 



as in (30). The action of G on the extended space of {configurations} x {times} 



is now given by 

g({xi},{\i},t) = (9,R,v,a,b)({xi},{Xi},t) 

= {{R Xi + vt + a}, {A, - (jg6 + v ■ R ■ xt + \v 2 t)} ,t + b). (33) 

With H|) and © it is easy to verify that this defines indeed an action. 
Hence it also defines an action on curves in the new Hilbert space Ti. := 
L 2 (R in ,d 3n xd n \), given by 

fgil>:=1>og- 1 , (34) 

which already maps solutions of the new Schrodinger equation to solu- 
tions, without invoking non-trivial phase factors. This is seen as follows: Let 
^({xi}, {Ai}, t) G H and ^({a;^}, {m,}, t) its Fourier transform in the (Aj, mi) 
arguments: 



$({ Xi }, {AJ, t) = (2irh)- n / 2 f d n m exp 

JW l 



y^TOjAj 



h 

i=l 



<P({Xi},{ mi },t). 



(35) 

For each set of masses {mj the function 4>s m .\({xi}, t) := ^({a;,-}, {m«}, t) 
satisfies the original Schrodinger equation. Since (03) does not mix different 
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sets of {rrii} it induces a map T^™^ for each such set: 

Tl m ' } <P {mt} ({ Xl },t) : = exp [tf + \M (v ■ (r c - a) - \v\t - b))] 

X *{m 4 }(S _1 ({*i}>*)) (36) 

Via the Fourier transform ([55]) we represent H as direct integral of 7ij mi \'s, 
each of which isomorphic to our old H = L 2 (R 3 ", d 3 a;i • • • d 3 x n ), and on each 
of which (|36|) defines a unitary representation U of G the form (|25| ) with U g 
the ray-representation (^H). This shows how the much simpler transformation 
law ([54]) contains the more complicated one (|2|) upon writing H as a direct 
integral of vector spaces Tis^x. 

In the new framework the overall mass, M, is a dynamical variable, and 
it would make sense to state a superselection rule with respect to it. But now 
G rather than G is the dynamical symmetry group, which acts by a proper 
unitary representation on 7i, so that the requirement that the dynamical 
symmetry group should act on the space of physical states will now not lead 
to any superselection rule. Rather, the new and more physical interpretation 
of a possible superselection rule for M would be that we cannot localize 
the system in the coordinate conjugate to overall mass, which we call A, 
i.e., that only the relative new positions A^ — Xj are observable.^ (This is 
so because M generates translations of equal amount in all Aj.) But this 
would now be a contingent physical property rather than a mathematical 
necessity. Note also that in our dynamical setup it is inconsistent to just state 
that M generates gauge symmetries, i.e. that A corresponds to a physically 
non existent degree of freedom. For example, a motion in real time along 
A requires a non- vanishing action (for non- vanishing M), due to the term 
J dt MA in the expression for the action. 

If decoherence were to explain the (ficticious) mass superselection rule, it 
would be due to a dynamical instability (as explained in p3 ) of those states 
which are more or less localized in A. Mathematically this effect would be 
modelled by effectively removing the projectors onto /1-subintervalls from the 
algebra of observables, thereby putting M (i.e. its projectors) into the center 
of O. Such a non-trivial center should therefore be thought of as resulting 
from an approximation-dependent idealisation. 



A system {{Xi,rhi}) of canonical coordinates including M = Xii m » ' s e -S- ^1 := 
Ai, mi = M and As = A, — Ai, rhi = rrn for i = 2...n. Then A = Ai. 
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5 Charge Superselection Rule 

In the previous case I said that superselection rules should be stated within a 
dynamical framework including as dynamical degree of freedom the direction 
generated by the superselected quantity. What is this degree of freedom in 
the case of a superselected electric charge and how does it naturally appear 
within the dynamical setup? What is its relation to the Coulomb field whose 
role in charge-decoherence has been suggested in [|l5| ? In the following discus- 
sion I wish to investigate into these questions by looking at the Hamiltonian 
formulation of Maxwell's equation and the associated canonical quantization. 

In Minkowski space, with preferred coordinates {x 11 — (t,x,y, z)} (labo- 
ratory rest frame), we consider the spatially finite region Z = {(t,x,y,z) : 
x 2 + y 2 + z 2 < R 2 }. £ denotes the intersection of Z with a slice t = const, 
and dS =: Sr its boundary (the laboratory walls). Suppose we wish to solve 
Maxwell's equations within Z , allowing for charged solutions. It is well known 
that in order for charged configurations to be stationary points of the action, 
the standard action functional has to be supplemented by certain surface 
terms (see e.g. pj]]) which involve new fields on the boundary, which we call 
A and /, and which represent a pair of canonically conjugate variables in 
the Hamiltonian sense. On the laboratory walls, dS, we put the boundary 
conditions that the normal component of the current and the tangential com- 
ponents of the magnetic field vanish. Then the appropriate boundary term 
for the action reads 

/ dtdu(\ + 0)/, (37) 
J z 

where (f> is the scalar potential and cko the measure on the spatial boundary 2- 
sphcre rescaled to unit radius. Adding this to the standard action functional 
and expressing all fields on the spatial boundary by their multipole moments 
(so that integrals J as dujR 2 , dw = measure on unit sphere, become Y^i m )> 
one arrives at a Hamiltonian function 

H = [ [±(£ 2 + (V x A) 2 ) +<f>( p -V-E)-A-j]+y] <Pi m (E lm - f lm ). 

Js lm 

(38) 

Here the pairs of canonically conjugate variables are (A(x), —E(x)) and 
(Xirm flm), and Ei m are the multipole components of n • E, 

Ei m := f duR 2 Y lm n ■ E, (39) 
Jas 

where n is the normal to dS. The scalar potential <fi has to be considered as 
Lagrange multiplier. With the given boundary conditions the Hamiltonian is 
differentiable with respect to all the canonical variables^] and leads to the 

14 This would not be true without the additional surface term (|37|). Without it 
one does not simply obtain the wrong Hamiltonian equations of motions, but 
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following equations of motion 

SH 

m 

-E = -—=j- Vx (Vx A), (41) 
OH 

km = 777 — = -<t>lm » (42) 
OJhn 

f lm = -^—=0. (43) 

These are supplemented by the equations which one obtains by varying with 
respect to the scalar potential </>, which, as already said, is considered as 
Lagrange multiplier. Varying first with respect to <f>(x) (i.e. within S) and 
then with respect to <fri m (i.e. on the boundary dS), one obtains 



G(x) : = V • E(x) - p(x) = 0, (44) 

Ghn ■ = Ei, n — fi m = 0. (45) 

These equations are constraints (containing no time derivatives) which, once 
imposed on initial conditions, continue to hold due to the equations of motion. 

□ 

This ends our discussion of the classical theory. The point was to show that 
it leaves no ambiguity as to what its dynamical degrees of freedom are, and 
that we had to include the variables A; m along with their conjugate momenta 
fim in order to gain consistency with the existence of charged configurations. 
The physical interpretation of the A; m 's is not obvious. Equation ( [42| ) merely 
relates their time derivative to the scalar potential's multipolc moments on 
the boundary, which are clearly highly non-local quantities. The interpreta- 
tion of the fim's follow from ( f45| ) and the definition of Ei m , i.e. they are the 
multipole moments of the electric flux distribution (p(n) := R 2 n ■ E(R 2 n). 
In particular, for I = = m we have 

/oo = (4vr)-^Q, (46) 

none at all! Concerning the Langrangean formalism one should be aware that the 
Euler-Lagrange equations may formally admit solutions (e.g. with long-ranged 
(charged) fields) which are outside the class of functions which one used in the 
variational principle of the action (e.g. rapid fall-off). Such solutions are not 
stationary points of the action and their admittance is in conflict with the vari- 
ational principle unless the expression for the action is modified by appropriate 
boundary terms. 

15 Equation (^) together with charge conservation, p + V ■ j = 0, shows that (|SJ) 
is preserved in time, and (^) together with the boundary condition that n ■ j 
and n x (V x A) vanish on dS show that (kq) is preserved in time. 
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where Q is the total charge of the system. Hence we see that the total charge 
generates motions in Aoo- But this means that the degree of freedom labelled 
by Aoo truly exists (in the sense of the theory). For example, a motion along 
Aoo will cost a non- vanishing amount of action oc Q(AQQ al — A Q Itlal ). A dec- 
laration that Aoo really labels only a gauge degree of freedom is incompatible 
with the inclusion of charged states. Similar considerations apply of course to 
the other values of l,m. But note that this conclusion is independent of the 
radius R of the spatial boundary 2-sphere d£. In particular, it continues to 
hold in the limit R — > oo. We will not consistently get rid of physical degrees 
of freedom that way, even if we agree that realistic physical measurements 
will only detect field values in bounded regions of space-time. See for 
more discussion on this point and the distinction between proper symmetries 
and gauge symmetries. 

It should be obvious how these last remarks apply to the statement of 
a charge superselection rule. Without entering the technical issues (see e.g. 
|53)), its basic ingredient is Gauss' law (for operator- valued quantities), lo- 
cality of the electric field and causality. That Q commutes with all (quasi-) 
local observables then follows simply from writing Q as surface integral of 
the local flux operator R 2 n ■ E, and the observation that the surface may 
be taken to lie in the causal complement of any bounded space-time region. 
Causality then implies commutativity with any local observable. 

In a heuristic Schrodinger picture formulation of QED one represents 
states ^ by functions of the configuration variables A(x) and A; m . The mo- 
mentum operators are obtained as usual: 

-^--^< < 47 > 
flm — -i^- ■ (48) 

In particular, the constraint ( fl5| ) implies the statement that on physical states 
iS> we have0 

QV= -iy/4^J?—&. (49) 

(Moo 

This shows that a charge superselection rule is equivalent to the statement 
that we cannot localize the system in its Aoo degree of freedom. Removing by 
hand the multiplication operator Aoo (he. the projectors onto Aoo-intervals) 
from our observables clearly makes Q a central element in the remaining 
algebra of observables. But what is the physical justification for this removal? 
Certainly, it is valid FAPP if one restricts to local observations in space-time. 
To state that this is a fundamental restriction, and not only an approximate 



16 Clearly all sorts of points are simply sketched over here. For example, charge 
quantization presumably means that Aoo should be taken with a compact range, 
which in turn will modify (^8|) and (^(j) . But this is irrelevant to the point stressed 
here. 
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one, is equivalent to saying that for some fundamental reason we cannot 
have access to some of the existing degrees of freedom, which seems at odds 
with the dynamical setup. Rather, there should be a dynamical reason for 
why localizations in Aoo seem FAPP out of reach. The idea of decoherence 
would be that localizations in Aoo are highly unstable against dynamical 
decoherence. 

We have mainly focussed on the charge superselection operator /oo, al- 
though the foregoing considerations make it clear that by the same argument 
any two different asymptotic flux distributions also define different supers- 
election sectors of the theory. Do we expect these additional superselection 
rules to be physically real? First note that for I > the fi m are not directly 
related to the multipole moments of the charge distributions, as the latter 
fall-off faster than A and are hence not detectable on the sphere at infin- 
ity. Conversely, the higher multipole moments fi m are not measurable (in 
terms of electromagnetic fields) within any finite region of space-time, unlike 
the charge, which is tight to massive particles; any finite sphere enclosing all 
sources has the same total flux. But the fi m can be related to the kinematical 
state of a particle through the retarded Coulomb field. In fact, given a par- 
ticle with constant momentum p, charge e and mass m, one obtains for the 
electric flux distribution at time t on a sphere centered at the instantaneous 
(i.e. at time t) particle position^ 

em 2 [p 2 + m 2 ]? 
<Pp( n ) = -JTTi iTa- ( 51 ) 

Hence different incoming momenta would induce different flux distributions 
and therefore lie in different sectors. Given that these sectors exist this means 



Formula (^) requires a little more explanation: for a particle with general tra- 
jectory z(t) let il be the retarded time for the space-time point (x,t), i.e., 
t' = t — \\x — z(t')\\ (c = 1 in our units). Now we can use the well known 
formula for the retarded electric field (e.g. (14.14) in and compute the flux 
distribution on a sphere which lies in the space of constant time t, where it is 
centered at the retarded position z(t') of the particle. This flux distribution can 
be expressed as function of the retarded momentum p' := p(t') and the retarded 
direction n' := [as - z(t')]/\\x - z(t')\\ as follows (E' := vV + ™ 2 ): 

^V("0 = ^ [E ,_* , n , ]2 - (50) 

If the particle moves with constant velocity v := z, the expression for the retarded 
Coulomb field can be rewritten in terms of the instantaneous position z(t) by 
using z(t) — z(t') + v\\x — z(t')\\. With respect to this center it is purely radial. 
Then one calculates the flux distribution on a sphere which again lies in the space 
of constant time t, but now centered at z(t) rather than z{t'). This function can 
be expressed in terms of the instantaneous direction n := fas — z(t)]/||a; — z(t)\\ 
and the instantaneous momentum p := p(t). One obtains (pl|) . 
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that different incoming momenta cannot be coherently superposed and no 
incoming localized states be formed, unless one also adds the appropriate 
incoming infrared photons to just cancel the difference of the asymptotic flux 
distributions. This is achieved by imposing the 'infrared coherence condition 
of Zwanziger [ pd^ the effect of which is to 'dress' the charged particles with 
infrared photons which just subtract their retarded Coulomb fields at large 
spatial distances. Hence coherent superpositions of particles with different 
momenta can only be formed if they are dressed by the right amount of 
incoming infrared photons. 

As a technical aside we remark that this can be done without violating the 
Gupta-Bleuler transversality condition fc^a M (fc)|in) = in the zero-frequency 
limit, precisely because of the surface term (p7|)pi[. This resolved an old is- 
sue concerning the compatibility of the infrared coherence condition on one 
hand, and the Gupta-Bleuler transversality condition on the other 
From what we said earlier concerning the consistency of the variational prin- 
ciple in the presence of charged states, such an apparent clash of these two 
conditions had to be expected: without the surface variables one cannot main- 
tain gauge invariance at spatial infinity (i.e. in the infrared limit) and at the 
same time include charged states. In the charged sectors the longitudinal 
infrared photons correspond to real physical degrees of freedom and it will 
naturally lead to inconsistencies if one tries to eliminate them by imposing 
the Gupta-Bleuler transversality condition also in the infrared limit. How- 
ever, a gauge symmetry in the infrared limit can be maintained if one adds 
the asymptotic degrees of freedom in the form of surface terms. 

These remarks illustrate how the rich superselection structure associ- 
ated with different asymptotic flux distributions fi m renders the problem 
of characterizing state spaces in QED for charged sectors fairly complicated. 
This problem has been studied within various formalisms including algebraic 
QFT S and lattice approximations, where the algebra of observables can be 
explicitly presented [E5|. However, all this takes for granted the existence of 
the superselection rules, whereas we would like to see whether they really 
arise from some physical impossibility to localize the system in the degrees 
of freedom labelled by Xi m - What physics should prevent us from forming 
incoming localized wave packets of charged undressed (in the sense above) 
particles, which would produce coherent superpositions of asymptotic flux 
distributions from the sectors with I > 1? This cries out for a decoherence 
mechanism to provide a satisfying physical explanation. The case of charge 
superselection is, however, more elusive, since localizations in Aoo do not have 
an obvious physical interpretation. Compare the controversy between ||i|,p9| 
on one side and IM on the other. 



Basically it says that the incoming scattering states should be eigenstates to the 
photon annihilation operators a'^(k) in the zero-frequency limit. 
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